Identifying the number of sources in speech mixtures with the Mean Shift algorithm

نویسندگان

  • David Ayllón
  • Cosme Llerena
چکیده

Blind Source Separation (BSS) of speech sources has been subject of study during many years, and it still remains largely open and unsolved. Traditional BSS methods based on statistical properties of the signals, as well as recent methods such as time-frequency masking, normally need to know in advance the number of sources in the mixture to perform the separation. Additionally, there are many applications where speech source enumeration can be very useful. Bearing in mind the need of automatic source enumeration of speech sources, this paper proposes a novel pruning algorithm based on the mean shift algorithm for speech separation. The results obtained identifying the number of sources in linear mixtures of 2, 3 and 4 sources support the suitability of the proposed algorithm. Key–Words: Blind Source Separation, Source Enumeration, Speech enhancement

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The effect of redesign workstation on Speech Interference Level (SIL) among bank tellers

Abstract Background: There is always an interaction between man and his environment that can be the cause of physical, physiological and psychological stress on people and also cause discomfort, annoyance, and have direct and indirect effects on their performance and productivity, health and safety. People in their workplace are exposed to many factors related to work activities and environmen...

متن کامل

Applying mean shift and motion detection approaches to hand tracking in sign language

Hand gesture recognition is very important to communicate in sign language. In this paper, an effective object tracking and hand gesture recognition method is proposed. This method is combination of two well-known approaches, the mean shift and the motion detection algorithm. The mean shift algorithm can track objects based on the color, then when hand passes the face occlusion happens. Several...

متن کامل

Blind Separation of L Sources from M Mixtures of Speech Signals

In many real-world applications of blind source separation, the number of mixture signals, M available for analysis often differs from the number of sources, L which may be present. In this paper, we extend a successful and efficient kurtosis maximization algorithm used in speech separation of two sources from two linear mixtures for use in problems with arbitrary numbers of sources and mixture...

متن کامل

اصلاح ردیاب انتقال متوسط برای ردگیری هدف با الگوی تابشی متغیر

The mean shift algorithm is one of the popular methods in visual tracking for non-rigid moving targets. Basically, it is able to locate repeatedly the central mode of a desirable target. Object representation in mean shift algorithm is based on its feature histogram within a non-oriented individual kernel mask. Truly, adjusting of the kernel scale is the most critical challenge in this method. ...

متن کامل

معرّفی الگوریتم جدید DESICA برای جداسازی کور سیگنال منابع گفتار در حالت پویا

Abstract: We consider a new scenario in blind speech separation problem in which the number and the features of active sources change with time in opposite to the previous methods in which all sources are active all the time. Accordingly, we propose the new DESICA algorithm for source separation which is a compound of the ICA and DESPRIT algorithms. In this algorithm, using the ICA, the separat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012